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^-MEDIATm-GTO^onTFTr^TTQM tm MAMMftT.TAN CEX JjS . 
AND COMPOSITIONS AND CELLS USEFUL THEREFOR 



This invention relates to recombinant DNA 
technology. In a particular aspect, this invention 
relates to methods for the site-specific recombination of 
DNA in mammalian cells or host mammalian organisms. In 
another aspect, the present invention relates to novel 
DNA constructs, as well as compositions, cells and host 
organisms containing such constructs. In yet another 
aspect, the present invention relates to methods for the 
activation and/or inactivation of expression of 
functional genes. In a further aspect, the present 
invention relates to methods for the introduction of DNA 
into specific sites in the genome of mammalian cells. In 
a still further aspect, the present invention relates to 
gene therapy methods. In still another aspect, the 
present invention relates to means for the recovery of 
transfected DNA from a cell or host organism. In a still 
further aspect, the present invention relates to assay 
methods . 



2 0 BACKGROUND OF THE INVENTION 

Many recent manipulations of gene 
expression involve the introduction of transfected genes 
(transgenes) to confer some novel property upon, or to 
alter some intrinsic property of, mammalian cells or 

25 organisms. The efficacy of such manipulations is often 

impaired by such problems as the inability to control the 
chromosomal site of transgene integration; or the 
inability to control the number of copies of a transgene 
that integrate at the desired chromosomal site; or by 

30 difficulties in controlling the level, temporal 

characteristics, or tissue distribution of transgene 
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expression; or by the difficulty of modifying the 
structure of transgenes once they are integrated into 
mammalian chromosomes. 

Transgenes are often introduced into 
5 mammalian cells or organisms to determine which 

components of a transgene are required for specific 
qualitative or quantitative alterations of the host 
system. Since both chromosomal position and copy number 
are major determinants of transgene function, the 
, 10 usefulness of these analyses is limited because current 
techniques for efficiently introducing transgenes into 
mammalian hosts result in the insertion of a variable 
number of transgene copies at random chromosomal 
positions. It is, therefore, difficult (if not 

15 impossible) to compare the effects of one transgene to 
those of another if the two transgenes occupy different 
chromosomal positions and are present in the genome at 
different copy numbers. Considerably more refined 
analyses would be possible if one could routinely 

2 0 introduce single copies of a variety of transgenes into a 
defined chromosomal position. 

The spatial or temporal characteristics of 
transgene expression is difficult to control in intact 
organisms. The restricted expression of transgenes is 

25 potentially of great interest, as this technique can be 

employed for a variety of therapeutic applications, e.g., 
for the selective interruption of a defective gene, for 
the alteration of expression of a gene which is otherwise 
over-expressed or under-expressed, for the selective 

30 introduction of a gene whose product is desirable in the 
host, for the selective removal or disruption of a gene 
whose expression is no longer desired in the host, and 
the like. 

Transgene expression is typically governed 
35 by a single set of control sequences, including promoters 
and enhancers which are physically linked to the 




4 

-3- 

transgenes (i.e., cis-acting sequences). Considerably 
greater expression control could be achieved if transgene 
expression could be placed under the binary control of 
these cis-acting sequences, plus an additional set of 
5 sequences that were not physically linked to the 

transgenes (i.e., trans-acting sequences). A further 
advantage would be realized if the transient activity of 
these trans-acting functions resulted in a stable 
alteration in transgene expression. In this manner, it 

10 would be possible, for example, to introduce into a host 
a transgene whose expression would have lethal or 
deleterious effects if it was constitutively expressed in 
all cells. This would be accomplished by delaying the 
expression of the transgene to a specific time or 

15 developmental stage of interest, or by restricting the 
expression of the transgene to a specific subset of the 
cell population. 

It is currently difficult (if not 
impossible) to precisely modify the structure of 

2 0 transgenes once they have been introduced into mammalian 
cells. In many applications of transgene technology, it 
would be desirable to introduce the transgene in one 
form, and to then be able to modify the transgene in a 
defined manner. By this means, transgenes could be 

25 activated or inactivated or the sequences which control 
transgene expression could be altered by either removing 
sequences present in the original transgene or by 
inserting additional sequences into the transgene. 

Previous descriptions of recombinase- 

30 mediated rearrangement of chromosomal sequences . in 
Drosophila and mammalian cells have not directly 
addressed the question of whether site-specific 
recombinases could routinely create a functional 
translational reading frame. Moreover, the reported 

35 efficiency of the prior art recombinase system, in the 

only other description of site-specific recombination in 
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mammalian cells reported to date [based on Cre 
recombinase, described by Sauer and Henderson in Nucleic 
Acids Research . Vol. 17: 147 (1989)] appears to be quite 
low. 

5 

BRIEF DESCRIPTION OF THE INVENTION 

In accordance with the present invention, 
we have developed a system for the selective modification 
of chromosomal or extrachromosomal DNA in mammalian 

10 cells. Selective modification can involve the insertion 
of one DNA into another DNA (e.g., to create a hybrid 
gene, to activate a gene, to inactivate a gene, and the 
like), or the removal of specific DNA molecule(s) from 
other DNA molecule (s) containing the DNA to be removed 

15 (e.g., to inactivate a gene, to bring desired DNA 

fragments into association with one another, and the 
like) . 

The recombination system of the present 
invention is based on site-specific recombinase, FLP. In 

20 one application of the invention recombination system, 

FLP-mediated removal of intervening sequences is required 
for the formation of a functional gene. Expression of 
the functional gene therefore, falls under the control of 
both the regulatory sequences associated with the 

25 functional gene and also under the control of those 
sequences which direct FLP expression. 

The reverse of the above-described 
process, i.e., the FLP-mediated introduction of DNA, 
provides a convenient and selective means to introduce 

30 DNA into specific sites in mammalian chromosomes. 

FLP-mediated recombination of marker genes 
provides a means to follow the fate of various sequences 
over the course of development and/or from generation-to- 
generation. The recombination event creates a functional 

35 marker gene. This gain-of-f unction system can be used 
for lineage analyses in a wide variety of tissues in 
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different organisms. Prior to FLP-mediated 
recombination, the marker gene is normally silent, i.e., 
the phenotype typical of the marker is not observed. In 
the absence of FLP, spontaneous recombination to produce 
5 functional marker occurs only at very low frequencies. 
In the presence of FLP, functional marker is efficiently 
produced. In addition, this gain-of -function system is 
heritable and is easily detected by simple histochemical 
assays. For example, in transgenic mice, the lineages in 

10 which recombination is to occur can be controlled by 

appropriate selection of the promoters used to drive FLP 
expression. This could include promoters that are only 
transiently active at a developmental stage that 
substantially precedes overt cell differentiation. Since 

15 transcription of the marker gene is controlled by 

regulatory sequences associated therewith, functional 
marker genes can be expressed at later developmental 
stages, after cell differentiation has occurred. By this 
means, it is possible to construct a map for mammalian 

20 development that correlates embryonic patterns of gene 
expression with the organization of mature tissues. 

BRIEF DESCRIPTION OF THE FIGURES 
^ p, 3^ 1ft a^dl&^Figure 1 presents schematic diagrams of 

v^L^p^ 2 5 FLP-mediated recombination events. In Figure 1A, FLP- 

mediated introduction of DNA is illustrated, while in 
Figure IB, FLP-mediated removal of intervening sequences 
is illustrated. 

fci)to*ot* y 2.B y and2C, Figure 2 is presented in three parts. 
^Jjfc®^ 30 Figure 2A presents schematic diagrams of the expression 
X * vectors pFRT/?GAL, pNEO/3GAL, and pOG44 FLP. Figure 2B 

presents a Southern blot of Hirt lysates prepared from 
293 (human embryonic kidney) cells transfected with one 
microgram of pNEO/3GAL and varying amounts of the pOG4 4 
35 FLP expression vector. Figure 2C graphically presents 
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25 



30 



the /3-galactosidase activities in the same transf ections 
shown in part B, referred to above. 



ptyiwrti 3ft-c\rtd3|. Figure 3A, at the top, presents a 
schematic of the pattern of plasmid integration in E25 
deduced from Southern blot analysis. Figure 3A, in the 
middle, presents the predicted pattern for 
/?-galactosidase positive subclones of E25 if precise 
recombination across the FLP-recombination target sites 
occurs. Figure 3A, at the bottom, presents the predicted 
pattern for /?-galactosidase negative, neomycin resistant 
subclones of E25B2 after FLP mediated insertion of pOG45. 
Figure 3B presents an analysis of genomic DNA from a cell 
line with a single integrated copy of pNE0/3GAL (i.e., 
CVNEO/3GAL/E2 5, designated as E25) , two derivative 
j0-galactosidase-positive subclones (designated as E25B1 
and E25B2), and two subclones derived from E25B2 after 
transfection with pOG45 (designated as B2N1 and B2N2) . 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, 
there is provided a mammalian recombination system 
comprising: 



(i) FLP recombinase, or a nucleotide sequence 
encoding same, and 

(ii) a first DNA comprising a nucleotide 
sequence containing at least one FLP 
recombination target site. 

In accordance with another embodiment of 



the present invention, there are provided novel DNA 
constructs useful for the introduction of DNA into the 
genome of a transfected organism, said DNA construct 
comprising, as an autonomous fragment: 




(a) 



(b) 



at least one FLP recombination target 
site , 

at least one restriction endonuclease 
recognition site, 



Si 
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(c) 



at least one marker gene, 

a bacterial origin of replication, and 

optionally 

a mammalian cellular or viral origin of 
DNA replication. 

In accordance with yet another embodiment 



(e) 



5 



of the present invention, there are provided novel DNA 
constructs useful for the rescue of DNA from the genome 
of a transfected organism, said DNA construct comprising, 
10 as an autonomous fragment, in the following order, 
reading from 5 f to 3 1 along said fragment: 

(a) a first FLP recombination target site, 

(b) an insert portion comprising, in any 
suitable sequence: 

15 (1) at least one restriction endonuclease 

recognition site, 

(2) at least one marker gene, 

(3) a bacterial origin of replication, 
and optionally 

20 (4) a mammalian cellular or viral origin 

of DNA replication, and 

(c) a second FLP recombination target site in 
tandem with said first FLP recombination 
target site. 

25 In addition, there are provided methods for the recovery 
of transfected DNA from the genome of a transfected 
organism employing the above-described constructs. 

In accordance with still another 
embodiment of the present invention, there is provided a 

30 method for the assembly of a functional gene (which is 

then suitable for activation of expression) , in mammalian 
cells, by recombination of individually inactive gene 
segments derived from one or more gene(s) of interest, 
wherein each of said segments contains at least one 

35 recombination target site, said method comprising: 
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contacting said individually inactive gene 
segments with a FLP recombinase, under 
conditions suitable for recombination to occur, 
thereby providing a DNA sequence which encodes 
5 a functional gene of interest. 

In accordance with a further embodiment of 
the present invention, there is provided a method for the 
disruption of functional gene(s) of interest, thereby 
inactivating expression of such gene(s), in mammalian 
10 cells, wherein said gene(s) of interest contain at least 
one FLP recombination target site, said method comprising 
contacting said gene(s) of interest with: 

(i) a DNA segment which contains at least one 
FLP recombination target site, and 
15 (ii) FLP recombinase; 

wherein said contacting is carried out under conditions 
suitable for recombination to occur between said gene and 
said DNA segment, thereby disrupting the gene(s) of 
interest and rendering said gene(s) non-functional. 
20 In accordance with a still further 

embodiment of the present invention, there is provided a 
method for the precisely targeted integration of DNA into 
the genome of a host organism, said method comprising: 

(i) introducing a FLP recombination target 
25 site into the genome of cells which are 

compatible with the cells of the subject, 

(ii) introducing a first DNA comprising a 
nucleotide sequence containing at least 
one FLP recombination target site therein 

30 into the FLP recombination target site in 

the genome of said cells by contacting 
said cells with said first DNA and FLP 
recombinase, and thereafter 
(iii) introducing the cells produced by the 

35 process of step (ii) into said subject, 
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wherein the resulting cells and/or 
organism have the optional ability to 
have DNA reproducibly and repetitively 
inserted into and/or recovered from the 
host cells and/or organism. 
In accordance with another aspect of the 
present invention, there are provided mammalian cells, 
wherein the genomic DNA of said cells contain at least 
one FLP recombination target site therein. 

In accordance with yet another aspect of 
the present invention, there are provided transgenic, 
non-human mammals, wherein said mammals contain at least 
one FLP recombination target site in the genomic DNA 
thereof . 

In accordance with yet another aspect of 
the present invention, there is provided a method for the 
site-specific integration of transfected DNA into the 
genome of the above-described cells and/or transgenic, 
non-human mammals, said method comprising: 
*(i) contacting said genome with: 

(a) FLP recombinase, and 

(b) a first DNA comprising a 
nucleotide sequence containing 
at least one FLP recombination 
target site therein; and 
thereafter 

(ii) maintaining the product of Step (i) 

under conditions suitable for site- 
specific integration of said DNA 
sequence to occur at the FLP 
recombination target site in said 
genome. 

In accordance with a further aspect of the 
present invention, there is provided a method for the 
analysis of the development of a mammal, said method 
comprising: 
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(a) providing a transgenic mammal comprising: 

(i) an expression construct encoding FLP 
under the control of a conditional 
promoter, and 

5 (ii) a reporter construct under the 

control of the same or a different 
promoter, wherein said reporter 
construct encodes a functional or 
non-functional reporter gene product, 

i0 and wherein said construct contains 

at least one FLP recombination target 
site therein, 

wherein the functional 
expression of the functional reporter 

!5 gene is disrupted when said FLP 

recombination event occurs, or 

wherein the functional 
expression of the non-functional 
reporter gene commences when said FLP 

2 0 recombination event occurs; and 

(b) following the development of said mammal to 
determine when expression of functional reporter gene 
product either commences or is disrupted. 

In accordance with a still further aspect 
25 of the present invention, there is provided a co- 

transfection assay FLP-mediated recombination, said assay 
comprising: 

(a) co-transfecting a host mammalian cell with: 

(i) a FLP expression plasmid, and 
30 (ii) a reporter plasmid comprising a 

reporter gene inactivated by the presence 
of at least one recombination target site; 
and 

(b) monitoring said host cell under a variety 
35 of conditions for the gain of expression of functional 

reporter gene product. 
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FLP recombinase is a protein which 



catalyzes a site-specific recombination reaction that is 
involved in amplifying the copy number of the 2/i plasmid 
of S.cerevisiae during DNA replication. FLP protein has 
5 been cloned and expressed in E.coli [see, for example, 
Cox, in proceedings of the National Academy of Sciences 
U.S.A. , Vol. 80: 4223-4227 (1983)], and has been purified 
to near homogeneity [see, for example, Meyer-Lean, et 
al., in Nucleic Acids Research, Vol. 15: 6469-6488 

10 (1987)]. FLP recombinases contemplated for use in the 
practice of the present invention are derived from 
species of the genus Saccharomyces . Preferred 
recombinases employed in the practice of the present 
invention are derived from strains of Saccharomyces 

15 cerevisiae. Especially preferred recombinases employed 
in the practice of the present invention are proteins 
having substantially the same amino acid sequence as set 
forth in Sequence I.D. No. 2, as encoded, for example, by 
Sequence I.D. No. 1, or the sequence set forth by Hartley 

20 and Donelson, Nature 286: 860 (1980) . 



(sometimes referred to herein as "FRT" ) has also been 
identified as minimally comprising two 13 base-pair 
repeats, separated by an 8 base-pair spacer, as follows: 



^<5^The nucleotides in the above "spacer" region can be 

replaced with any other combination of nucleotides, so 
long as the two 13 base-pair repeats are separated by 8 
nucleotides. The actual nucleotide sequence of the 
35 spacer is not critical, although those of skill in the 
art recognize that, for some applications, it is 
desirable for the spacer to be asymmetric, while for 



The FLP recombination target site 



25 



-Spacer- 

5 » -GAAGTTCCTATTC [ TCTAGAAA ] GTATAGGAACTTC-3 1 




Xbal 
site 
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other applications, a symmetrical spacer can be employed* 
Generally, the spacers of the FLP recombination target 
sites undergoing recombination with one another will be 
the same . 

5 As schematically illustrated in Figure 1A, 

contact of genomic DNA containing a FLP recombination 
target site (shown as the linear Psv- BETA-GAL construct) 
with a vector containing a FLP recombination target site, 
in the presence of the protein, FLP recombinase, results 

10 in recombination that forms a new genomic sequence 
wherein the vector sequences have been precisely 
incorporated into the genome of the host. The reverse of 
this process is shown schematically in Figure IB, wherein 
a genomic sequence or construct containing two tandemly 

15 oriented FLP recombination target sites, upon contacting 
with FLP, is recombined and the FLP recombination target 
site-bounded fragment is excised as a circular molecule. 

Genes of interest contemplated for use in 
the practice of the present invention can be selected 

20 from genes which provide a readily analyzable functional 
feature to the host cell and/or organism, e.g., visible 
markers (such as /3-galactosidase, thymidine kinase, 
tyrosinase, and the like) , selectable markers, (such as 
markers useful for positive and negative selection, e.g., 

25 genes for antibiotic resistance) , as well as other 

functions which alter the phenotype of the recipient 
cells, and the like. 

The first DNA employed in the practice of 
the present invention can comprise any nucleotide 

30 sequence containing at least one FLP recombination target 
site, which will precisely define the locus at which FLP- 
mediated recombination will occur. The nucleotide 
sequence can comprise all or part of a gene of interest, 
as well as other sequences not necessarily associated 

35 with any known gene. Optionally, for ease of later 
recovery of the gene of interest (in "activated" or 
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modif ied form) , the first DNA can optionally contain a 
second FLP recombination target site. 

The second DNA employed in the practice of 
the present invention is selected from at least a second 
portion of the first gene of interest or at least a 
portion of a second gene of interest (including an intact 
form of a second gene of interest) . When the second DNA 
is at least a second portion of the first gene of 
interest, the site-specific recombination of the present 
invention may act to provide a functional combination of 
the different portions of the first gene of interest. 
Alternatively, when the second DNA is at least a portion 
of a second gene of interest, the site-specific 
recombination of the present invention may act to provide 
a functional hybrid gene, which produces a product which 
is not identical with either the product of the first 
gene or the second gene. As yet another alternative, 
when the second DNA is a portion of a second gene, the 
site-specific recombination of the present invention may 
act to disrupt the function of the first gene of 
interest. Based on the nature of the first DNA and the 
second DNA, as well as the mode of interaction between 
the two, the site-specific interaction of the present 
invention may create or disrupt a feature which is 
colorimetrically detectable, immunologically detectable, 
genetically detectable, and the like. 

In accordance with the present invention, 
assembly of a functional expression unit is achieved in 
any of a variety of ways, e.g., by association of the 
gene of interest with a functional promoter, by assembly 
of common gene fragments to produce a complete functional 
gene (which, in combination with its promoter, comprises 
a functional expression unit) , or assembly of diverse 
gene fragments from diverse sources to produce a 
functional, hybrid gene (which, in combination with a 
promoter, comprises a functional expression unit) , and 
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the like. Upon assembly of a functional expression unit 
as described herein, expression of the functional gene to 
produce a protein product can be activated in the usual 
manner. In the absence of FLP-mediated recombination, 
5 activation of expression would fail to produce a 
functional protein product. 

In accordance with the present invention, 
dis-assembly of a functional expression unit is achieved 
in any of a variety of ways, e.g., by dis-associating the 

10 gene of interest from a functional promoter, by dis- 
assembly (e.g., disruption) of the functional gene (e.g., 
by introduction of DNA which renders the entire sequence 
non-functional) , by removal of a substantial portion of 
the coding region of said gene, and the like. Upon dis- 

15 assembly of a functional expression unit as described 

herein, expression of the functional gene product under 
the conditions required prior to gene dis-assembly is no 
longer possible. The ability of the expression unit to 
be activated for expression has therefore been disrupted. 

20 The gene in this situation can be said to be inactivated, 
since activation of expression is not possible. 

Individually inactive gene segments 
contemplated for use in the practice of the present 
invention are fragments which, alone, do not encode 

25 functional products. Such fragments can be derived from 
a first gene of interest alone, or from both a first and 
second gene of interest DNA fragments. 

When gene inactivation is desired, the 
gene of interest can be disrupted with a DNA fragment 

3 0 which throws the gene of interest out of reading frame 

(e.g., an insert wherein the number of nucleotides is not 
a multiple of 3) . Alternatively, the gene of interest 
can be disrupted with a fragment which encodes a segment 
which is substantially dissimilar with the gene of 

3 5 interest so as to render the resulting product non- 
functional. As yet another alternative, the gene of 
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interest can be disrupted so as to dis-associate the gene 
of interest from the transcriptional control of the 
promoter with which it is normally associated. 

The introduction of DNA, e.g., DNA 
5 encoding FLP recombination target sites, into the genome 
of target cells can be accomplished employing standard 
techniques, e.g., transf ection, microinjection, 
electroporation, infection with retroviral vectors, and 
the like. 

10 Introduction of protein, e.g., FLP 

recombinase protein, to host cells and/or organisms can 
be accomplished by standard techniques, such as for 
example, injection or microinjection, transfection with 
nucleotide sequences encoding FLP, and the like. 

15 When employed for gene therapy of an 

intact organism, introduction of transgenic cells into 
the subject is accomplished by standard techniques, such 
as for example, grafting, implantation, and the like. 

Mammalian cells contemplated for use in 

20 the practice of the present invention include all members 
of the order Mammalia, such as, for example, human cells, 
mouse cells, rat cells, monkey cells, hamster cells, and 
the like. 

Host organisms contemplated for use in the 
25 practice of the present invention include each of the 
organism types mentioned above, with the proviso, 
however, that no claim is made to genetically modified 
human hosts (although the present invention contemplates 
methods for the treatment of humans) . 
30 Once FLP recombinase (or DNA encoding 

same) and DNA containing at least one FLP recombination 
target site have been introduced into suitable host 
cells/organisms, the cells/host organisms are maintained 
under conditions suitable for the site-specific 
35 recombination of DNA. Such conditions generally involve 
conditions required for the viability of the host cell or 
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organism. For in vitro manipulations, conditions 
employed typically involve low concentrations of a 
variety of buffers having a pH of between about 5-9 and 
ionic strengths in the range of about 50-350 mM. See, 
5 for example, Senecoff, et al., in Journal of Molecular 
Biology. Vol. 201 : 405-421 (1988). 

In accordance with a particular aspect of 
the present invention, a co-transf ection assay has been 
developed which can be used to characterize FLP-mediated 
10 recombination of extrachromosomal DNA in a variety of 

cell lines. Cells are co-transf ected with an expression 
construct and a "reporter" plasmid that is a substrate 
for the recombinase. The expression construct encodes a 
FLP recombinase protein. The reporter plasmid encodes 
15 either a functional reporter gene containing at least one 
recombination target site therein, or a non-functional 
reporter gene containing at least one recombination 
target site therein. Upon expression of FLP by the 
expression construct, the functional reporter gene will 
2 0 be rendered non-functional, or the non-functional 

reporter gen'e will be rendered functional. Thus, the 
activity of the expression construct can be assayed 
either by recovering the reporter plasmid and looking for 
evidence of recombination at the DNA level, or by 
2 5 preparing cytoplasmic extracts and looking for evidence 
of recombination at the protein level (i.e., by 
measuring the expression of reporter gene activity 
generated by the recombined reporter) . Such assays are 
described in greater detail in Example 1 below. 
30 The invention will now be described in 

greater detail by reference to the following non-limiting 
examples . 
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EXAMPLES 



Example 1 — Co-transf ection Assays. 
The co-transf ection assay used to 



characterize FLP-mediated recombination of 
extrachromosomal DNA involved transfection of cells with 
an expression construct and a "reporter" plasmid that was 
a substrate for the recombinase. The activity of the 
expression construct could be assayed either by 
recovering the reporter plasmid and looking for molecular 
evidence of recombination at the DNA level, or by 
preparing cytoplasmic extracts and looking for evidence 
of recombination at the protein level (i.e., by measuring 
0-galactosidase activity generated by recombined 
reporter) . 



these assays was derived from pFRT/?GAL (Fig. 2 A) . In the 
Figure, half-arrows indicate positions of FLP 
recombination target (FRT) sites; E and S designate EcoRI 
and Seal restriction sites, respectively; Psv designates 
early promoter from SV4 0; BETA-GAL designates the /3- 
galactosidase structural sequence; NEO designates 
neomycin expression cassette; Pcmv designates the 
cytomegalovirus immediate early promoter; IN designates 
an intron; FLP designates a FLP coding sequence; AN 
designates an SV40 adenylation cassette; thin lines 
represent vector sequences; and the sizes of restriction 
fragments are indicated in kb. 



bacterial /?-galactosidase sequence modified by insertion 
of a FLP recombination target site, or FRT, within the 
protein coding sequence immediately 3 1 to the 
translational start. The oligonucleotide used for the 
construction of pFT/3RGAL was: 



The pNE0/3GAL reporter plasmid used for 



pFRT/3GAL contains a version of the 



5 1 -GATCCCGGGCTACCATGGA » GAAGTTCCTATTC * C GAAGTTCCTATTC 
( TCTAGA) AAGTATAGGAACTTC A-3 » . 
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This oligonucleotide contains an in-frame start codon, 
minimal FRT site, and an additional copy of the 13-bp FRT 
repeat [ ° XXX ° ] ; the Xbal site within the FRT spacer is 
enclosed in parentheses. The linker was inserted between 
the BamHI and Hindlll sites of pSKS105 (M.J. Casadaban, 
A. Martin-Arias, S.K. Shapira, and J. Chou, Mettu EnzymoL 
100, 293 (1983)) and the LacZ portion of modified gene 
was cloned into a pSV2 vector. The neomycin cassette 
used for construction of pNEO/3GAL was an Xhol to BamHI 
fragment from pMClneo-polyA (K. Thomas and M. Capecchi, 
Cell 51:503 (1987)) cloned between copies of the J3 FRT 
site in pUCl9. 



pair (bp) repeats and an 8-bp spacer that together 
comprise the minimal FRT site, plus an additional 13-bp 
repeat which may augment reactivity of the minimal 
substrate. The /3-galactosidase translational reading 
frame was preserved upon insertion of the FRT site, and 
the resulting plasmid, pFRT/3GAL, generated robust 
activity in mammalian cells (Table 1) . 



pFRT^GAL in the middle of the FRT site with Xbal and then 
inserting an Xbal fragment consisting of two half-FRT 
sites flanking a neomycin transcription unit. This 
created intact FRTs on either side of the neomycin 
cassette and rendered the /3-galactosidase transcription 
unit inactive (Table 1) . Precise FLP-mediated 
recombination of the FRTs caused the excission of the 
neomycin cassette, recreated the parental pFRT/JGAL 
plasmid, and restored /?-galactosidase expression. 



amount of pNE0/3GAL reporter plasmid and increasing 
amounts of the pOG4 4 FLP expression vector generated 
increasing amounts of recombined reporter plasmid and 
consequently, increased levels of /3-galactosidase 
activity. Molecular evidence for FLP-mediated 



The FRT consists of two inverted 13-base- 



pNEO/?GAL was constructed by cutting 



Co-transf ection of cells with a fixed 
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recombination was obtained by recovering plasmids 36 
hours after transfection, followed by endonuclease 
treatment (with EcoRI and Seal) and Southern blotting 
(see Fig. 2B; employing as a probe the fragment of 
pFRT/JGAL indicated at the top of Fig. 2 A) . Lysates of 
cells from cotransf ections that included the pOG44 FLP 
expression vector showed a signal at 5.6 kb, the position 
at which recombined reporter (equivalent to pFRT^GAL) 
would run, and a 3 . 2 kb signal that was generated by 
unrecombined pNEO/?GAL reporter (Fig. 2A) . The 5.6 kb 
band intensity was proportional to the amount of FLP 
expression plasmid included in the transfection. The 5.6 
kb band was not seen in cotransf ections in which a non- 
FLP plasmid was substituted for the FLP expression vector 
(Fig. 2B) or in transf ections that contained only pOG44 
(and no reporter plasmid) . pOG44 generated additional 
signals at 2 . 2 kb and 2.8 kb because the plasmid used in 
its construction contained EcoRI and EcoRI-Scal fragments 
of such length. 

pOG4 4 consists of the cytomegalovirus 
immediate early promoter from pCDM8 [see Aruffo and Seed 
in Proc. Natl Acad. Sci. , USA 84:8573 (1987)], a 5 1 
leader sequence and synthetic intron from pMLSIScat [see 
Huang and Gorman in Nucl. Acids Res. 1JB: 937 (1990)], the 
FLP coding sequence (bp 5568-6318 and 1-626 of the 2/xm 
circle, [see Hartley and Donelson, Nature 286 ; 860 
(1980)] and the SV40 late region polyadenylation signal 
from pMLSIScat. The following silent nucleotide 
substitutions were introduced into the structural FLP 
sequence using the polymerase chain reaction: C for T at 
position 5791, G for A at 5794, G for C at 5800, C for T 
at 55, G for A at 58, and C for T at 103. These changes 
eliminated three cannonical AATAAA polyadenylation 
signals and introduced a PstI restriction site without 
altering the amino acid sequence encoded by the 
nucleotide sequence. pOG28 consists of a murine cDNA for 
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dihydrof olate reductase cloned into pCDM8 (Aruffo and 
Seed, supra) . 

In the same samples, /3-galactosidase 
activity was also proportional to the amount of FLP 
5 expression plasmid included (Fig* 2C) . Only background 
activities were observed in cotransf ections that included 
a non-FLP control plasmid (Table 1) or when pOG44 alone 
was transfected. The experiment thus provides both 
molecular and biochemical evidence for precise FLP- 

10 mediated recombination in mammalian cells. 

Table 1 presents £-galactosidase 
activities in cotransf ection assays of 293, CV-1, and F-9 
cells. Positive control transf ections (pFRT/?GAL) 
included 1 /xg of pFRT/3GAL and 18 jug of the pOG28 non-FLP 

15 control plasmid; negative control transf ections 

(pNEO/3GAL) included 1 tig of pNEO/3GAL and 18 ixg of the 
pOG28; and experimental transf ections (pNE0/3GAL + FLP) 
contained 1 jig of pNE0/?GAL and 18 ^g of the p0G44 FLP 
expression plasmid (Fig. 1A) . The column headed by "%" 

2 0 shows the pNEO/?GAL + FLP values as a percentage of the 
pFRT/?GAL positive control. Each value represents the 
mean for six plates from two experiments. Standard 
errors are in parentheses. Neither pOG28 nor pOG44 
generated /3-galactosidase activity when transfected 

25 alone. All transf ections contained 1 fig of pRSVL [de Wet 
et al., Mol. Cell. Biol. 7: 725 (1987)] to correct /3- 
galactosidase activities for relative transfection 
efficiencies . 

Subconfluent cultures of cells in 10 cm 
30 dishes and grown in Dulbecco's modified Eagle's medium 
(DMEM) and 5% calf serum were transfected by overnight 
exposure to calcium phosphate precipitates [Graham et 
al., Virology 36:59 (1979)] and then split 1:4. After 24 
hours incubation, one plate of each transfection was 
35 harvested by Hirt extraction [J. Mol Biol. 26:365 (1967)] 
and a second plate was used to prepare cytoplasmic 
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extracts [de Wet et al., supra 1 . Approximately 5% of the 
DNA recovered from single plates was used for Southern 
analyses. /3-galactosidase assays were performed as 
described by Hall et al., in J. Mol. Appl. Genet. 2:101 
(1983)]. Luciferase activities generated by the 
inclusion of 1 ixq of pRSVL (de Wet et al . , supra ) in all 
transf ections were used to correct /3-galactosidase 
activities for relative transfection efficiencies. The 
experiment was repeated twice with similar results. 

TABLE 1: /3-GALACTOSIDASE ACTIVITIES (UNITS/ MG PROTEIN) 
IN COTRANSFECTED CELLS 



CELL LINE 




TRANS FECTIONS 




% 




pFRT/3GAL 


pNEO/?GAL 


pNEO/3GAL 
+ FLP 




293 


30.4 (1.9) 


0.17 (0.02) 


14.2 (2.2) 


47 


CV-1 


275 (25) 


0.33 (0.06) 


22.6 (1.2) 


8 


F9 


24.8 (4.3) 


0.04 (0.01) 


1.88 (0.02) 


8 



30 FLP activity has also been demonstrated in 

monkey kidney (CV-1) and mouse embryonal carcinoma (F9) 
cells. In Table 1, the /?-galactosidase activity in the 
"pFRT^GAL" transf ections represents an estimate of the 
expression expected if all the pNEO/?GAL in a co- 

3 5 transfection were immediately recombined. The highest j3- 
galactosidase expression in co-transf ections employing 
pNEO/3GAL plus pOG44, relative to pFRT/3GAL transf ected 
cells, was 47%, seen in 293 cells. This is a remarkable 
level considering that /3-galactosidase expression 

40 required both FLP expression, followed by recombination 

of pNEO/3GAL, to produce a construct capable of expressing 
/3-galactosidase . Co-transf ections of CV-1 and F9 cells 
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generated 8% of the activity seen in the pFRT/?GAL 
transfections. Even at this lower relative activity, 
cotransf ected cells were readily observed in 
histochemical reactions for 0-galactosidase activity. 



Example 2 — FLP-Mediated Removal of 

Intervening Sequences 
If the invention method is to be widely 



applicable, for example for gene activation in transgenic 
10 mammals, the ability of FLP to faithfully promote precise 
recombination at FLP recombination target sites contained 
in the mammalian genome is required. Such ability is 
demonstrated in this example. 



15 copies of pNEO/3GAL (designated CVNE0/3GAL/E) were produced 
by transfecting CV-1 cells with linearized plasmid by 
electroporation, then isolated by selecting G418- 
resistant (G418 R ) transf ectants that stably expressed the 
neomycin cassette, and finally identifying single copy 

20 lines by Southern blot analyses (Fig. 3) . As previously 
shown for other integrated constructs with similarly 
short direct repeats, the chromosomal FRTs did not 
spontaneously recombine (in the absence of FLP) to 
produce a /J-galactosidase-positive (0GAL + ) phenotype at 

25 detectable frequencies (Table 2) . 



CVNEO/JGAL/E lines (by transiently transfecting with the 
pOG44 FLP expression vector) promoted a rapid conversion 
to a /3GAL* phenotype. When five different lines were 

30 transiently transfected with the pOG44 FLP expression 

vector, /?-galactosidase activities at 36 hours were 40 to 
100-fold higher than those seen in replicate plates 
transfected with a non-FLP plasmid (Table 2) . At 48 
hours after transfection histochemical processing showed 

35 many positive cells (Table 2) . 



Cell lines that contain single integrated 



Transient expression of FLP in the 
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Table 2 presents the 0-galactosidase 
phenotypes of CVNEO/3 GAL/ E lines, which contain a single 
copy of the 0-galactosidase inactive reporter, pNEO/?GAL, 
after transfection with FLP expression (pOG44), non-FLP 
5 negative control (pOG28) or /3-galactosidase positive 

control (pFRT/?GAL) plasmids. The pFRT/3GAL transf ections 
included Ipg of pFRT/3GAL and 19/xg of pOG44; other mixes 
contained 20/ig of the indicated plasmid. /?-galactosidase 
activities are mean values for triplicate transf ections 

10 performed as described for Fig. 2 and assayed 36 hours 
after removal of precipitates; standard errors for the 
pOG44 transf ections were less than 10% of the mean. The 
percent positive was determined by scoring more than 10 3 
cells after transfection and histochemical processing as 

15 described by de Wet et al., supra . 



TABLE 2 



0-GALACTOSIDASE PHENOTYPES OF 
\\ "a * 1 TRANSFECTED CVNEO0GAL CELL LINES 

2 0 

CELL ACTIVITIES PERCENT POSITIVE 

LINE (units/mg protein) 



25 



pOG28 pOG4 4 pOG28 pFRT/3GAL pOG44 



E6 0.24 11.2 Of 8.7 6.1 

30 E25 0.21 16.7 Of 17.1 12.4 

E26 0.18 7.2 Of 19.5 15.4 

E14 0.28 13.1 ND ND ND 

E22 0.09 9.6 ND ND ND 

35 fN° positive cells were found among >10 6 cells examined. 
ND: Not done. 
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To provide some estimate of the efficiency 
of recombination, an additional set of replicate plates 
were transfected with the pFRT/?GAL /?-galactosidase 
expression vector. Comparing the fractions of cells that 
5 were /3GAL+ in the pFRT/?GAL and in the pOG4 4 transf ections 
(assuming similar transfection efficiencies) suggests 
that most (70-80%) of the cells transfected with pOG44 
were converted to a 0GAL + phenotype (Table 2) . The 
comparison undoubtedly underestimates the efficiency of 

10 FLP-mediated excision. Whereas many copies of a 

functional 0-galactosidase gene were available for 
immediate transcription in the positive controls, 
recombination may have occurred shortly before harvest in 
some pOG44-transf ected cells. In these cases the single 

15 recombined reporter gene may not have generated enough 
/3-galactosidase by the time of harvest to render the 
cells positive in this assay. 

The /3GAL + phenotype was passed on to all 
descendents of many FLP-converted cells. Positive 

20 colonies were formed during prolonged expansion of 

individual colonies. Entirely negative colonies and 
mixed colonies were also observed. Mixed colonies would 
be expected if recombination occurred after mitosis in 
only one descendent of a transfected cell, or if 

25 recombined and unrecombined cells mixed at replating or 
during subsequent growth. Indeed, the physical 
segregation of phenotypes evident in most mixed colonies 
suggested that they were composed of stably positive and 
negative lineages . 

30 The correlation between /?-galactosidase 

expression and recombination at FRT sites was examined by 
comparing the structure of the integrated pNEO/?GAL 
sequences in two /3GAL + subclones to the parental line. 
CVNEO/JGAL/E2 5 (106) cells were transfected with the pOG44 

3 5 FLP expression vector and subcloned 12 hours after 
removal of the precipitate. After histochemical 
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screening, two /3GAL* subclones (E25B1 and E25B2) were 
expanded for further analysis. In Southern blots of 
genomic DNA from both subclones, the pattern of 
hybridization matched that expected for FLP-mediated 
5 recombination of the FRT sites in the parental line (Fig, 
3). While recombination products have not been recovered 
and sequenced, these Southern analyses and the fact that 
activation of /3-galactosidase expression required 
creation of a functional translational reading frame 
10 indicate that FLP-mediated recombination was precise. 

Example 3 -- FLP Mediated Recombination 

of FRT on an 

Extrachromosomal Molecule 
15 With a Chromosomally 

Integrated FRT. 

Reversal of the process described in the 
previous Example, i.e., the FLP-mediated recombination of 

2 0 an FRT site on a plasmid with a chromosomally integrated 

FRT site, can be used to target the integration of 
transfected plasmids to specific genomic sites. To 
determine the frequency at which this occurs, G418- 
sensitive, /?GAL + E25B2 cells were co-transf ected with the 

25 pOG44 FLP expression vector and a plasmid, pOG45, that 

contained a neomycin resistance gene expression cassette 
and a single FRT. pOG45 consisted of the neomycin 
resistance cassette and 3 1 FRT from pNEO/?GAL cloned into 
pUC19. 8 x 10 5 CVNEO/3GAL cells were transfected by 

30 electroporation in 800 /il of saline containing 40 fig of 
pOG44 and 0.1 of either pOG45 or, for a negative 
control, pOG45A (which was derived from pOG45 by deleting 
a 200 bp fragment containing the FRT) . 

G418 R subclones (designated B2N) from three 

3 5 transf ections that had stably integrated pOG45 were 

histochemically stained for /3-galactosidase activity and 
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more than half (104 of 158, or "66%) were either entirely 
/?-galactosidase-negative (/JGAL* ) or predominantly /3GAL" 
with a few clusters of /?GAL* cells • The remaining 
colonies were /?GAL + . With continued passage as dispersed 
5 monolayers, the fraction of jSGAL* cells in the mosaic 

lines rapidly diminished. This suggested they were G418" 
sensitive cells that initially survived because of their 
proximity to resistant cells; this was confirmed by 
reconstitution experiments. All of the 55 colonies 
10 formed after parallel co-transf ections of pOG44 and a 
derivative of pOG4 5 (pOG4 5A) that lacked an FRT were 
/?GAL + . 

The correlation between loss of 
0-galactosidase activity and recombination between 
15 plasmid and chromosomal FRTs was examined in Southern 

analyses. Because the FRT and neomycin cassette of pOG45 
were derived from the neomycin cassette and 3 ' FRT of 
pNEO/?GAL (Fig. 2 A) , recombination of the plasmid FRT with 
the E2 5B2 chromosomal FRT regenerates the 3.2 kb EcoRI 

2 0 fragment of the original CVNEO/?GAL/E2 5 parent. 

Additionally, the 8.5 kb junctional fragment of 
CVNEO/3GAL/E2 5 shifts to 12.0 kb because pOG45 is 3.5 kb 
larger than the neomycin cassette of pNE0/3GAL. The 3.2 
kb EcoRI fragment and the 8.5 kb junctional fragment were 

25 observed in each of the 10 cell lines analyzed after 

initial histochemical classification as /?GAL" or mosaic, 
as shown for two such lines in Fig. 3B. In contrast, 
each of the four jSGAL+ colonies examined by Southern 
analyses showed that pOG4 5 had integrated at a random 

30 site. 

These data show that FLP-mediated 
recombination will target the integration of transfected 
DNA to a specific chromosomal site at frequencies that 
exceed those of random integration, and that the event 

3 5 can be marked by the alteration in gene activity at the 

target site. The efficiency of targeted integration can 
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be increased by standard optimization techniques, such as 
for example, by using ratios of the integrating plasmid 
and FLP expression vectors different from the single 
ratio mixture used here, or by using FRT mutations in the 
5 plasmid and chromosomal sites to decrease the frequency 
with which successfully integrated plasmids are 
subsequently excised . 

While the invention has been described in 
detail with reference to certain preferred embodiments 
10 thereof, it will be understood that modifications and 

variations are within the spirit and scope of that which 
is described and claimed. 
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SUMMARY OF SEQUENCES 

Sequence I.D. No. 1 is the approximately 
14 50 base-pair sequence encoding a FLP recombinase 
5 contemplated for use in the practice of the present 
invention, as well as the amino acid sequence deduced 
therefrom. 

Sequence I.D. No. 2 is the amino acid 
sequence deduced from the nucleotide sequence of Sequence 
10 ID No. 1. 



